Robust voice activity detection based on noise eigenspace
نویسندگان
چکیده
منابع مشابه
A Robust Voice Activity Detection Based on Noise Eigenspace Projection
A robust voice activity detector (VAD) is expected to increase the accuracy of ASR in noisy environments. This study focuses on how to extract robust information for designing a robust VAD. To do so, we construct a noise eigenspace by the principal component analysis of the noise covariance matrix. Projecting noise speech onto the eigenspace, it is found that available information with higher S...
متن کاملNoise Robust Voice Activity Detection
Voice activity detection (VAD) is a fundamental task in various speech-related applications, such as speech coding, speaker diarization and speech recognition. It is often defined as the problem of distinguishing speech from silence/noise. A typical VAD system consists of two core parts: a feature extraction and a speech/ non-speech decision mechanism. The first part extracts a set of parameter...
متن کاملOn Noise Robust Voice Activity Detection
In this paper, we show that the performance of voice activity detection algorithms (VAD) can be highly dependent on the type of background noise and we introduce a new VAD algorithm that is based on relative energy measurements in different frequency bands. The obtained experimental results are compared to the results obtained with two other spectrumbased VADs and it is concluded that a VAD, co...
متن کاملNoise robust voice activity detection based on switching kalman filter
This paper addresses the problem of voice activity detection (VAD) in noisy environments. The VAD method proposed in this paper is based on a statistical model approach, and estimates statistical models sequentially without a priori knowledge of noise. Namely, the proposed method constructs a clean speech / silence state transition model beforehand, and sequentially adapts the model to the nois...
متن کاملSpeaker-Dependent Voice Activity Detection Robust to Background Speech Noise
In this paper, we proposed a speaker-dependent voice activity detection (VAD) algorithm that only extracts the speech period uttered by a target user. Based on our survey of the recognition error of real speech data collected in “VoiceTra,” which is a speech-to-speech translation system for smartphones, we found that many word insertion errors are caused by background speakers’ speech. Our VAD,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Acoustical Science and Technology
سال: 2007
ISSN: 1346-3969,1347-5177
DOI: 10.1250/ast.28.413